Search results for "DNA sequences"
showing 5 items of 5 documents
Variable Ranking Feature Selection for the Identification of Nucleosome Related Sequences
2018
Several recent works have shown that K-mer sequence representation of a DNA sequence can be used for classification or identification of nucleosome positioning related sequences. This representation can be computationally expensive when k grows, making the complexity in spaces of exponential dimension. This issue effects significantly the classification task computed by a general machine learning algorithm used for the purpose of sequence classification. In this paper, we investigate the advantage offered by the so-called Variable Ranking Feature Selection method to select the most informative k − mers associated to a set of DNA sequences, for the final purpose of nucleosome/linker classifi…
AnABlast: Re-searching for Protein-Coding Sequences in Genomic Regions
2019
AnABlast is a computational tool that highlights protein-coding regions within intergenic and intronic DNA sequences which escape detection by standard gene prediction algorithms. DNA sequences with small protein-coding genes or exons, complex intron-containing genes, or degenerated DNA fragments are efficiently targeted by AnABlast. Furthermore, this algorithm is particularly useful in detecting protein-coding sequences with nonsignificant homologs to sequences in databases. AnABlast can be executed online at http://www.bioinfocabd.upo.es/anablast/ .
Aporocotyle mariachristinae n. sp., and A. ymakara VillalbaFernández, 1986 (Digenea: Aporocotylidae) of the pink cusk-eel, Genypterus blacodes (Ophid…
2012
Aporocotyle mariachristinae n. sp. and A. ymakara Villalba & Fernández, 1986 were collected from the bulbus arteriosus and ventral aorta of pink cusk-eels, Genypterus blacodes (Forster, 1801) from Patagonia, Argentina. A. mariachristinae n. sp. can be distinguished from all the species of Aporocotyle by the asymmetrical extension of posterior caeca (right posterior caecum longer, terminating at the area between mid-level of ovary and posterior body end; left posterior caecum shorter, terminating at the area between mid-level of cirrus sac and posterior to reproductive organs), the distribution of spines along the ventro-lateral body margins and the number of testes. The new species clearly …
Molecular markers indicate the phylogenetic identity of southern Brazilian sea asparagus: first record of Salicornia neei in Brazil
2019
Abstract Molecular phylogenetic analyses based on ETS, ITS and atpB - rbcL spacer sequences assessed the phylogenetic status of the southern Brazil sea asparagus species of the genus Salicornia (Salicornioideae, Amaranthaceae). Accessions of Patos Lagoon estuary (32° S) were obtained from wild plants and two pure line lineages, selected from contrasting prostrate (BTH1) and decumbent (BTH2) ecomorphotypes found locally. Patos Lagoon wild plants, BTH1 and BTH2 f4 progenies showed 100% identical sequences for the atpB - rbcL and ITS spacers, only two mutations for ETS. Comparison of the sequences of these three markers with GenBank records confirmed the identity of Brazilian accessions as Sal…
Normalised compression distance and evolutionary distance of genomic sequences: comparison of clustering results
2009
Genomic sequences are usually compared using evolutionary distance, a procedure that implies the alignment of the sequences. Alignment of long sequences is a time consuming procedure and the obtained dissimilarity results is not a metric. Recently, the normalised compression distance was introduced as a method to calculate the distance between two generic digital objects and it seems a suitable way to compare genomic strings. In this paper, the clustering and the non-linear mapping obtained using the evolutionary distance and the compression distance are compared, in order to understand if the two distances sets are similar.